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Abstract 

This article is concerned with the long time behavior of neutral genetic population 
models, with fixed population size. We design an explicit, finite, exact, genealogical 
tree based representation of stationary populations that holds both for finite and infi- 
nite types (or alleles) models. We then analyze the decays to the equilibrium of finite 
populations in terms of the convergence to stationarity of their first common ancestor. 
We estimate the Lyapunov exponent of the distribution flows with respect to the total 
variation norm. We give bounds on these exponents only depending on the stability 
with respect to mutation of a single individual; they are inversely proportional to the 
population size parameter. 

Keywords : Wright-Fisher model, neutral genetic models, coalescent trees, Lyapunov 
exponent, stationary distribution. 

Mathematics Subject Classification : 60J80, 60F17, 65C35, 92D10, 92D15. 

1 Introduction 

Stochastic models for population dynamics provide a mathematical framework for the anal- 
ysis of genetic variations in biological populations that evolve under the influence of evo- 
lutionary type forces such as selections and mutations. The common models, such as the 
(generalized) Wright-Fisher models, allow for many variations in the fundamental assump- 
tions. For example, one may consider a finite population, or a sample out of an infinite 
population; there might be a finite or infinite number of types (or alleles) of individuals; the 
genetics involved may refer to monoecious (i.e. where there is only one sex) or dioecious 
phenomena, and so on. 

Dealing with finite number of individuals without sampling in an infinite population is 
usually difficult: although the problems are easily formulated, computations become soon 
very involved. Here, we are interested in finite population models where the evolution is 
driven by a selection/mutation process and derive explicit formulas for the stationary state of 
the population or the decay to the equilibrium. We do not put any restriction on the number 
of types or alleles, that may be finite or infinite. The main restriction to a full generality of 
the model is neutrality of the selection process: that is, we assume that all individuals have 
the same reproduction rate (we refer e.g. to the monograph of M. Kimura [6] for a detailed 
account on the neutral theory of molecular evolution). 
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At the molecular level, the evolutionary models we consider correspond therefore to 
situations where the genetic drifts that govern the dynamics are the mutations combined 
with a uniform reproduction rate. Genealogical tree evolution models arise then in a natural 
way, when considering the complete past history of the individuals. These path space models 
can be described forwards with respect to the time parameter. Conversely, we can also trace 
back in time the complete ancestral line of all the individuals. This backward view of the 
ancestral structures is then interpreted as a stochastic coalescent process. 

The important questions that arise then are the precise description of the asymptotic 
behavior of evolution processes, and the corresponding time to equilibrium analysis. They 
are at the core of the modern development of mathematical biology. Let us also point out 
that, apart from their importance in biology, they are also related to the convergence of a 
class of genealogical tree algorithms used in advanced stochastic engineering, and in Bayesian 
statistics. For example, a full understanding of the long time behavior of genetic type 
branching models is essential in the designing of genealogical particle filters and smoothers, 
as well as for the tuning of genetic algorithms for solving global optimization problems. 
These features of evolutionary models are another strong motivation for the present work 
and, although we emphasize mainly the applications to genetics, the interested reader should 
keep in mind these other application areas (for further informations and references on the 
subject, see for instance the pair of recent books [2j [3] ) . 

During the last three decades, many efforts have been made to tackle these questions. 
Although several natural Markov chain models of genetic processes have been developed, 
their combinatorial complexity makes both the intuitive, and the rigorous understanding 
of the long time behavior of evolution mechanisms difficult. Several reduced models have 
been developed to obtain rigorous, but partial theoretical results. These simplified models 
are usually based on large population approximations, and appropriate rescaling of the time 
parameter in units of the total population size. 

Let us review very briefly three main directions of research in the field. The first one 
studies the diffusion limit model arising, through appropriate time rescaling techniques, 
when considering gene type fractions decompositions on large population sizes models (see 
for instance the seminal book of S. N. Ethier and T. Kurtz [8], and the references therein). 
A second idea is to consider the backward ancestral lineage in infinitely many allele models. 
This genealogical process is expressed in terms of rescaled Poisson coalescence epochs, and 
superimposed Poisson mutation events. Running back in time this population model, the 
coalescent of Kingman describes the ancestral genealogy of the individuals in terms of a 
simplified binary ancestral lineage tree. The Ewens' sampling formula describes the limiting 
distribution of the type spectrum in finite samples of the infinitely many allele models (see 
for instance the Saint Flour's lecture notes of S. Tavare and references therein). A third, 
more recent, idea is essentially based on the mean field interpretation of genetic models. In 
this context, the occupation measures of simple genetic populations converge, as the size of 
the systems tends to infinity, to a non linear Feynman-Kac semigroup in the distribution 
space. For a rather detailed account of these mean field limits, we refer the reader to the 
first author monograph [2]. In this interpretation, and in the case of reversible mutations, 
the long time behavior of an infinite population model corresponds to the ground state of 
Schrodinger type operators. 

In the present article we depart from these techniques and tackle directly the combina- 
torial complexity of finite population dynamics. This leads to a fine asymptotic analysis of a 
general class of neutral genetic models on arbitrary state spaces, with fixed population size. 
In contrast to the existing literature on the asymptotic behavior of neutral genetic models, 
our approach is not based on some had-oc rescaling of the time parameter, and it applies 
to evolutionary models with fixed population size. 

Firstly, we provide an explicit functional representation of their invariant measures in 
terms of planar genealogical trees. We then use this representation to analyze the decays to 
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stationary populations in terms of the convergence to the equilibrium of the first common 
ancestor and show, for example, that the Lyapunov exponent of the genetic distribution 
flow is inversely proportional to the population size of the model. This estimate is of course 
sharper than the one we would obtain using crude minorization techniques of the transitions 
probabilities of the genetic model, such as those presented in [4]. We refer to Section [2.31 
Theorems 12.11 and 12.21 for a precise statement of the main results of the article. 

Unfortunately, these rather natural genealogical techniques are restricted to neutral ge- 
netic population models, but it leads to conjecture that the same type of decays holds true 
for more general models. Besides, some of the ideas and techniques developed in the article 
can certainly be adapted to cover inhomogeneous selection rates -this is a point we leave 
deliberately out of the scope of the present work. 

We finally mention that the genealogical invariant measure derived in the present article 
belongs to the same class of tree based measures as the ones studied in pQ to derive coales- 
cent tree based functional representations of genetic type particle models. The symmetry 
properties of the invariant measure can be combined with the combinatorial analysis pre- 
sented in pQ to simplify notably the formula for the distributions of stationary populations. 
We briefly present these constructions in the last section. 

To the best of our knowledge, these genealogical tree based representations of stationary 
populations, as well as of the corresponding convergence decays to the equilibrium, are the 
first of this kind, for this class of evolutionary models. 

2 Neutral Genetic Models 
2.1 Conventions 

Let us introduce some notation. We denote respectively by A4(E), V{E), and B(E), the 
set of all finite signed measures on some measurable space (E,£), the convex subset of all 
probability measures, and the Banach space of all bounded and measurable functions / on 
E, equipped with the uniform norm ||/|| = sup x£E The space E will stand for the 

set of types or alleles of the genetic model. 

We let n(f) = J n(dx) f(x), be the Lebesgue integral of a function / G B(E), with 
respect to a measure \x G M.(E\ and we equip A4(E) with the total variation norm \\n\\t v = 
su P/eB(£):||/||<i 

We recall that a Markov transition M from E into itself, is an integral probability 
operator such that the functions 

M(f) : xeE^M(f)(x)= [ M{x,dy) f(y) G E 

are ^-measurable and bounded, for any / G B(E). It generates a dual operator \i fj,M 
from M.(E) into itself defined by (pM)(f) := fi(M(f)). For a pair of Markov transitions M\, 
and M2, we denote by M\M 2 the composition integral operator from E into itself, defined 
for any / G B b {E) by (M 1 M 2 )(f) := Mi(M 2 (/)). The tensor power M® N represents the 
bounded integral operator on E N , defined for any F G B(E N ) by 

M® N (F)(x N ,...,x N )= [ [M(xW) ... M(x N ,dy N )] F(y\...,y N ) 

Je n 

We let N be a fixed integer parameter, and we set [N] = {1, . . . , N}. The integer N will 
stand for the number of individuals in the population. We denote by m(x) = Yli=i ^x* 
the empirical measure associated with an iV-uple x = (x*)i<i<jv £ E . The notation 8 X 
stands in general for the Dirac measure at the point x. 

We let A = [Ny-^>, be the set of mappings from [N] into itself. We associate with a 
mapping a G A, the Markov transition D a on E defined by D a (x,dy) = 5 x a(dy), with 
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x a = (x°(*))j g [jv]- The transition T> a can be interpreted as a coalescent, or a selection 
type transition on the set E N . In this interpretation, the population of individuals x a = 
(x a W) ie [7v] results from a selection in x = (x l ) ie ^ of the individuals with labels [a(i))ie[N]- 
Also notice that the iV-tensor product of an empirical measure can be expressed in terms 
of these selection type transitions : 

m(x)® N (dy) = t^Y1 ^ d y) =■ v ( x > d y) 

We consider now a Markov transition M, and a probability measure r\ on E. The Markov 
transition M will represent the mutation transition process, whereas rj will stand for the 
initial distribution of the population. An A-neutral genetic model, with these parameters, 
can be represented by a pair of E^-valued Markov chains (£ n )neN = ((d)ie[iV])neN an d 
(Cn)neN = {(Cn)ie[N])nm, together with transitions 

selection ^ mutation 
in ► in > in+1 (2.1) 

The initial configuration £o consists of A independent and identically distributed random 
variables with common distribution rj. 

The selection transition £ n ~~> £ n corresponds to a simple Wright-Fisher selection model: 
it consists in sampling A conditionally independent random variables (£n)ie[iV]> with com- 
mon distribution m(£ n ). Equivalently, the selected population £ n is a random variable with 
distribution V(i n ,dx); in other words, we chose randomly a mapping A n in A, and we set 

in = in n ( = (£n n ^)ie[N]J ■ I n the literature on branching processes, this random selection 
mapping A n is often expressed in terms of a multinomial branching rules. Assume, for in- 
stance that all the i l n are different, if we let L n = (L % n ) ie ^ be the number of offsprings of 
the individuals £ n 

Vi G [N] L^:=|{iG[A] : & = 
then, we find that L n has a symmetric multinomial distribution 

P(Ll = l\...,L» = l N \i n ) X] : 



m...i N \ n n 



for any I = (l*) ie[N] G N w , with £. £[JV] ^ = 

During the mutation, each of the selected particles i l n evolves to a new location 
randomly chosen with distribution M, with i G [A]. Equivalently, the population £ n+ i 

after mutation is a random variable with distribution M(i n ,dx), with A4 := M® N . 

The distribution laws of the populations before and after the selection step are defined 
by the values, for any function F G Bb(E N ), of 

r„, n (F):=E(F(£ n )) and f „, n (F) := E (>(£„)) 

Notice that T^o = V® N > an d ^vfi = r^o^- By construction, we also have that F VjJl = F V:n V, 
and r^ in+ i = T v<n M.. This clearly gives the dynamical structure of the pair of distributions 

r„, n+ i = r„, n (VM) and f„, n+ i = f „ >n (MP) 

In particular, we have: 

r„, n (F) = r^ (DM)" (F) = E (T vfi D Ao MD Al . . . D Anl M(F)) 

and 

f v ,n(F) = (MV) n (F) = E (T vfi D Ao MD Al . . . MD An (F)) 
with a sequence of independent random variables (A p )o< n , uniformly chosen in the set A. 
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2.2 Genealogic trees representations 

In neutral reproduction models, the pair of mutation-selection processes can be separated 
in order to describe the mutation scenarios and the neutral selection reproductions on two 
different levels. Traditionally, these neutral genetic models are sampled in the following way. 
The genealogical structure of the individuals is first modelized. Then, the genetic mutations 
are superimposed on the sampled genealogy. 

In the present section, we design a representation of this pair of genetic processes in terms 
of random trees. The neutral selections are encoded by random mappings; their composition 
gives rise to random coalescent trees. The representation of the process is complete when 
mutation scenarios are taken into account. In contrast to traditional mutation-selection de- 
coupling models, non necessarily neutral selection processes could also be described forward 
in terms of these random trees. In this situation, the random selection type mappings would 
depend on the configuration of the genes. 

We let X*™, with i n = (i , ... ,i n ) G [A r ] n+1 , and n > 0, be the collection of E- valued 
random variables defined inductively as follows: at rank n = 0, Xq\ with io = io G [N], 
represents a collection of N independent, and identically distributed random variables with 
common distribution 7]. Given the random variable X*Si , for some multi index i n -i G [X] n , 
the sequence of random variables X^, with i n = (i n _i,i n ), and i n G [N] consists in N 
conditionally independent random variables, with common distribution 5 i n _ 1 M. 

X n-1 

Following the convention of [5], the collection of random variables X p p , with i p G [iV] p+1 , 
and < p < n, can be associated to the vertices of a planar forest of height n. The sequence 
of integers i n represents the complete genealogy of an individual at level n. For instance, 
an individual X^ in the second generation is associated with the triplet 12 = (10^1,12) of 
integers that indicates that he his the i2-th child of the i\-th child of the iQ-th root ancestor 
individual. Running back in time, we can trace back the complete ancestral line of a given 
current individual X^ 1 at the n-th generation 

A Q < A 1 < ...< A n-1 < A n 

The neutral selection, and the mutation transition introduced in (|2.ip have a natural inter- 
pretation in terms of these random forests. Roughly speaking, the random forests introduced 
above represent all the transitions of a selection-mutation genetic algorithm. 

To be more precise, it is convenient to introduce some additional algebraic structures. 
We equip A = [N]^ with the unital semigroup structure associated with the composition 
operation ab := aob, and the identity element Id. We equip the set A with the partial order 
relation defined for any pair of mappings a, c G A by the following formula 

a < c <^=^ 36 G A a = be 

For any collection of mappings (a p )Q< p < n , we notice that 

a0,n < fll,n < • • • < a»i-l,n < d n < Id 

with the composition semigroup a P)Tl = a p a p +i in , < p < n; and the convention a n ^ n = a n . 
For any weakly decreasing sequence of mappings (bo, . . . , b n ) G A n+l (that is, any sequence 
s.t. bi > bi + \, we set 

^(b n ,...,bo) ■— ( j^b n (i),...,bi(i),bo(i)\ 

n V 71 /l<i<7V 

In this notation, it is now easy to prove that the neutral genetic model can be seen as a 
particular way to explore the random forest introduced above 

,-, selection A mutation A rJ selection a a a mutation a a a 
■vld , vA) , vAoJa v AoAi,Ai -^AoAi^iJd 
A.q > SLq > A~y > sv-y ' ^2 ' ' ' 
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with a sequence of independent random variables (^4 p )o<n; randomly chosen in the set A. 
More generally, the distribution laws of the neutral genetic model are given by the formulae 

r( T?\ TT? i ( vA.Q n7 A± n ,...,A n — l,nAn 
lj,n{-T ) = ^ij I # \^-n 



with the semigroup A Pi7l := A p A p+ i tn , with < p < n, and the convention A n ^ n = Id. In 
much the same way, we also have that 

r,, n+ l(F) = f v , n M(F) = E„ (F f X Ao,n,Al,n,---,A n -l,n,A n M 



2.3 Asymptotic behavior 

The permutation mappings a G Qn are the largest elements of A, while the smallest elements 
of A are given by the constants elementary mappings ej(j') = i, for any j £ [N], with 
1 < i < N. The set of these lower bounds is denoted by 

dA = {a £ A : \a\=l} = { ei : i G [N]}, 

where \a\ stands for the number of elements in the image of a. We also let (-B p )o<p<n be 
the weakly decreasing Markov chain on A defined by 

Vn > 1 B n := A n B n ^ (2.2) 

with the initial condition Bq = Aq, where (^4 p )o<p<n stands for a sequence of independent 
and uniformly distributed random variables on the set of mappings A. We let T be the first 
time the chain B n enters in the set dA. 

T := inf {n > : B n £ dA} (2.3) 

In terms of genealogy, the Markov chain (|2.2p represent the ancestral branching process from 
the present generation at time back into the past. In this interpretation, the mapping A n in 
formula (|2.2p represents the way the individuals choose their parents in the previous ancestral 
generation. The range of A n represents successful parents with direct descendants, whereas 
the range of B n represents successful ancestors with descendants in all the generations till 
time 0. The random variable T represents the time to most recent common ancestor of an 
initial population with \Bq\ individuals. 

We are now in position to state the two main results of this article. 

Theorem 2.1 The Markov chain B n is absorbed by the boundary dA in finite time, and 
Bt is uniformly distributed in dA. In addition, for any time horizon n >0, we have 

PfT > n) < K ( — V l) exp (- ( — - l) with K := e TT (l- -, % 

The Theorem follows from (14. ljl . 



We further assume that the Markov transition M has an invariant probability measure 
/ion£, and we denote by the probability measure on E N defined by 

VF G B b {E N ) f^F) := E M (f (** T ' - > Bl ' B °)) (2.4) 
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Theorem 2.2 The measure r„ is an invariant measure of the N -neutral genetic model £„. 
In addition, if the mutation transition M satisfies the following regularity condition : 

(H) There exists some A > 0, and some finite constant 5 < oo such that for any n > 

f3{M n ):= sup \\M n (x,.)-M n (y,.)\\ tv <6 e~ Xn (2.5) 
(x,y)£E 2 

then, there exists K' > (a universal constant) such that for n > N + j, we have the 
estimate 



T — T 

x rj,n 1 fi 



< SK'^-exp f r 



This Theorem follows from Theorem 12. 1\ Equation (|3.5p and Corollary 14.21 Notice that the 
regularity condition is a weak condition, most often satisfied in selection-mutation genetic 
models -degenerate cases should be avoided such as, for example, in the finite state case, 
the splitting of the matrix representation of M into a direct sum, or the existence of several 
fixed points of the Markov transition. The regularity hypothesis is satisfied for example if 
M(x,dy) = a5 x (dy) + (1 — a)fj,(dy) with < a < 1 and /j, a Lebesgue-type density on E. 

We end this section with some consequence of these two theorems. Firstly, we notice 
that the first assertion of the theorem can alternatively be expressed in terms of the measure 
T M defined by 

VF G B h {E N ) T„{F) := E M (f ( X ^t,..,b u b ,u 

More precisely we notice that, by construction, T^{F) = T^MF). This readily implies 
that is an invariant measure of the neutral genetic model £ n . Indeed, we have 

T^{VM(F)) = Y^MV{MF)) = f^M{F) = T^F) 

The reverse assertion follows the same line of arguments. 

Notice also that the second assertion of the theorem can be used to estimate the Lya- 
punov exponent of the distribution semigroup of the neutral genetic particle model; that is 
we have that 

lim inf log 



+oo n 



T — T 



A 1 

> A — 

tv - XN + 1 N 



3 Random mappings and coalescent tree based measures 

Recall that we denote by \a\ we denote the cardinality of the set a([iV]). Notice that 
a < c \a\ < \c\, and the cardinality of the set 

A c ■= {a G A : a < c} 

coincides with the number of ways iVl c l of mappings the range c( [N] ) of the mapping c into 
the set [N]. Also observe that \dA\ = N, and A Bi = OA, for any i £ [N], where \5A\ stands 
for the cardinality of 5 A. In this notation, the transitions probabilities of the chain B n 
introduced in (12. 2p are given by 

F(FJ n = b | B n _i = a) = -L l Aa (b) (3.1) 

It is also readily checked that the uniform distribution z^o(a) := oj on ^4 is such that 

vo(a) = K(Id,a), and we also have that K{ei,b') = i I&a^')) where K stands for the 
Markov transition on A defined by 



Kty) :a£A^ K(cP)(a) := -L ^ <f>(ba) 



1-4 b ,A 
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for any function <f> on A. This clearly implies that the uniform measure v\(a) := jj1qa(cl) 
on the set dA is an invariant measure of K. That is we have that v\ = v\K. 

For any weakly decreasing sequence of mappings b = (60, • • • j b n ) G A n+1 , with a finite 
length /(b) = n, we write 

|b| := £ |& P | 

0<p<n 

For any c = (co, . . . , c n ) G ^4 n+1 , any mapping a £ A, and any 6 > Co, we also write 

(b, c) = (6, c , . . . ,c n ) ca : = (c a, . . . ,c n a) and c * a := (c a, . . . , c tn a) 

where t n = inf {0 < p < n : c p a G <9*4}; with the convention i n = n, if c p a 9^4 for any 
< p < n. 

The set of all weakly decreasing excursions from A into dA is given by 

C := U n >oC n 

with the sets C n of all weakly decreasing excursions c, with length /(c) = n > 0, and defined 
by 

Cn = {c = (c , . . . , Cn) G (^l - dA) n x dA : V0 < p < n c p > c p+1 } 
We use the convention Co = dA, for n = 0. We associate with the random excursion 

B = (Ab Bi, . . . , Bt) G C 

the pair of random excursions 

B'=(W,B)eC and B'*AeC, 

where ^4 stands for a uniformly distributed ^4-valued random variable. 

Lemma 3.1 The Markov chain B n is absorbed by the boundary dA in finite time, and Bt 
is uniformly distributed in OA. Furthermore, the excursions B and B' * A are distributed 
on C according to the same distribution p : c G C 1— » p(c) := l/jV^+C ' -1 ). 

Indeed, using the very crude upper bound 

/ j x n+1 

F(T > n) < P(V0 < p < n A p dA) = ( 1 - ^ij (3.2) 

we readily check that the chain B n is absorbed in the boundary subset dA in finite time: 
P(T < 00) = 1. Furthermore, by symmetry arguments, the entrance point Bt is uniformly 
distributed in dA, that is we have that F(Bt = a) = v\{a). 

From the computation of the conditional expectation in Eq. 13.14 it is easily checked that 
p is the distribution of the excursion B. Notice that it is a well defined probability measure 
on the set of excursions C since 

p(C n ) = P(T = n) p(C) = P(T < cxd) = 1 

By construction, we have 

B' = (B' , ...,B' T , B' t+1 ) = (Id, B) = (Id, B ,..., B T ), 

with T = {inf n > B n G dA}. This implies that 

B' *A:= (B' A, B[A, B' S A) = (A, B A, B X A, B S -iA) 



8 



with the stopping time 

5 = inf {n > : B' n A £ &4} = inf {n > : B n -iA £ &4} 

and the convention B-\ = Id. The lemma follows since, by the very definition of B n , the 
Markov chain B n _iA, n > has the same distribution as the Markov chain B n . In more 
concrete terms, and for further use, we notice that, for any test function / we have: 

E(/(B' * A)) = E (f( B -i A > B oA B\A, B n . x A) l s=n ) 
n>0 

n—1 j Cn ) 

n>0 (c ,..,c„)eC„ 

= £ /(c) p(c) = E(/(B)) (3.3) 

cgC 

(where v stand for the uniform probability measure on A). ■ 



Definition 3.2 We associate with a probability measure rj £ V(E), and a weakly decreasing 
sequence of mappings b = (bo, . . . ,b n ) £ A n+1 , a probability measure W^,...^) £ V(E N ) 
defined for any F £ Bb(E N ) by 



V(b ,...,b n )(F) :=E V (F (X^ > b ° 

By the definition of the neutral genetic model, for any weakly decreasing sequence of map- 
pings b in A, and any mapping a £ A we have 

VbM = V(Id,b) and r, h D a = r] ha (3.4) 

In addition, for any excursion c £ C, any path a = (a%, . . . , a m ) £ dA m , and any a £ A we 
have 

»7(c,a) = {v Mm )c H{c,&) = Mc and therefore /x ca = /i c * a 

Since (Aq, . . . , A n ) has the same distribution as the reversed sequence (A n , . . . , Aq), we also 
find that 

rt 77A TT7 1 ( T? I V^O.Tli^l.nr-^t-l.ni^fl 



_ ^A„...Ao,A n _i...A ,...,^i J 4o,A ^ 

= E, [F (^B tt ,B tt _ 1 ,...,B 1 ,Bo)) = E (^ iBn) (F)) 

Proposition 3.3 // i/ie Markov transition M has an invariant probability measure \i on E, 
then the probability measure 

f,(F) := J>(c) Mc(^) =E(/x (Boi ... iSt) (F)) 
cec 

zs an invariant measure of the neutral genetic model (£ n ) n >o- Under the regularity condition 
(H), the neutral genetic model has a unique invariant measure T^, and we have the estimate 



Vra > 



r — r 



< 5 E(e" A ^ T ) l T <n) + 2 P(T > n) (3.5) 



TOt/i the pair of parameters (5, A) introduced in (E 
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Indeed, if we take the excursion B = (Bq,B\, . . . , -By), then by (|3.3p we find that 

E (tivM® N D A (F)) = E( mtB) D A (F)) 

= E ^ {m ,b)a(F)) = E {n {Id>B>A (F)) = E (M^)) 

Since E(/xg(.F)) = r„(i^), the end of the proof of the first assertion of the theorem is 
completed. To prepare the proof of the second assertion, firstly we notice that 

r, {Cja) (F) = E v (F (X^""^)) = E,m (F {x^^Jj = faM) c (F) 

for any excursion c = (co,...,c n ) G C, with lenght 1(c) = n, and any a £ dA. More 
generally, for any path a = (01, . . . ,a m ) £ dA m , we have 



*7(c,a) = {riM m \ 



This clearly implies that 



T v , n (F) = E {ri( Bo ,...,B n )(F) 1 T <«) + E (rj(B ,...,B n )(F) l T>n ) 

= E((r ? M- T ) (Boi ... jBT) (F) l T < n ) +E(r/ (B0) ... )Bn) (F) 1 T >„) 

To take the final step, we consider the decomposition 

f „, ft (F) — Tfi(F) = E ( [(vM n - T ) {Bo ,...,B T ) ~ ^(flo Br)] 1t<„) 

+E(7 7(Bo ,... iBn) (F) - M(B0! ... )Bre) (F)) l T>n ) 

For any excursion c = (co, • • • , c ra ) € C, with lenght /(c) = n, we notice that 
[77c - Mc ](F) = / [r? - M ](cfe) x E 5l (F fc c — co ))) 

J E 

Since /i = /iM n_T , applying the majoration 12.51 we get 

f„ >n (F) - P M (F)| < 5 E(e" A ( n - T ) l T <n) + 2P(T > n) 
for any ||.F|| < 1. This ends the proof of the proposition. 



4 Absorption times behavior 

We study here the renormalized time T/N for large integers N, where T is defined in (|2.3p . 
when the mapping-valued Markov chain (B n ) n€ ^ has the identity as initial state. Let us 
recall that T can be interpreted as the time a neutral genetic model with N particles has to 
look backward to encounter its first common ancestor. The main result is the convergence 
in law of T/N, which shows that T is of order N, but we will also be interested in more 
quantitative bounds in this direction. 

To begin with, we notice that T only depends on (|-B n |)neN an d that this [iV] -valued 
stochastic chain is Markovian, with transitions described by 

V n£n,V l<p<q<N, n\Bn+l\ = P | |B n | = q) = %,p)^f 

(where S(q,p) is the Stirling number of the second kind giving the number of ways of 
partitioning the set [q] into p non empty blocks and where (N) p := Nl/(N — p)\) and 
starting from N if Bq = Id. 
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This observation leads us to define for N G N*, a triangular transition matrix by 



Vl<p,g<iV, Mjf := S^p) 



Ni 



and to consider for any 1 < i < N, a Markov chain R^ N ^ := (Rn'^)neN starting from i 
and whose transitions are governed by M^ N \ Such a Markov chain is (a.s.) non-increasing 
and 1 is an absorption state. We denote 

s (N,i) ._ inf { n e N . 2j(JV,i) = X } 

so that T has the same law as S^ N ' N '. The goal of this section is to prove the 

Theorem 4.1 Let (ijv)jveN* be a sequence of integers satisfying 1 < i^ < -/V /or an?/ JVgN* 
and diverging to infinity. Then the following convergence in law takes place for large N 



S( n ^n) 
N 



^(1 + 1 



(/ + !)(/ + 2) 



6 



where (£/)/gN * s an independent family of exponential variables of parameter 1. Furthermore, 
we have for any fixed < a < 1, 

sup E[exp(aS^ N ' iN) /N)] < exp(a)n(a) 

TVeN* 



where 



V < a < 1, 



11(a) 



2a 



ieN (Z+l)(Z+2) 



is i/ie Laplace transform of the above limit law. Thus for any continuous function f : 
M+ verifying lim Q _ + i_ limsup s ^ +00 exp(— as)f(s) = 0, we are insured of 



lim E\f(S^ N ' iN) /N)] = E 

N^oo 



! E 



(Z + 1)(Z + 2 



-5/ 



In particular, for any given < a < 1, via a Markov inequality, we deduce the exponential 
upper bound 

VJVGN*,Vl<i<JV,VnGN, F[S^ N ' l) > n] < exp(a)II(a) exp(-an/iV) 

K 

< exp(— an IN) 

1 — a 



with K = e (n, eN . (1 - (7+l)(7+27 
1, we obtain that for any positive integers n and N, 



Optimizing this inequality with respect to < a < 



V 1 < i < N, F[S {N ^ >n] < K V l) exp(-(n/iV - 1)+) (4.1) 

(of course this bound does not give any relevant information for n < N, when the l.h.s. 
probability is not small). As it was explained in the second section, such an inequality is 
useful to estimate convergence to equilibrium for neutral genetic models and the results 
presented in the introduction follow from it. 

Indeed, in view of proposition 13.31 we still need another estimate, but it is an immediate 
consequence of the above bound: 
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Corollary 4.2 There exists a constant K ' > such that for any A > 0, any JV £ N*, any 

1 < i < N and any n > N + 1/X, we have 



E 



exp(-X(n-S^))l {siN ,^ n} ] < K'^exp 



1 + XN ) N 



Indeed, let m € [n] be given, we have 



E 



exp(-A(n-S^)))l {5Wi) <„ } 



E 



exp(-A(n - ^)))l {5(iV , l) < m} j +E [exp(-A(n - ^))l {m < 5 wo<„} 



< exp(-A(n - m)) + P(5 Wi) > m) 



< exp(-A(n — m)) + if y— V 1 ) exp(-(m/iV - 1) 



m 



n 



< (1 + K)— max(exp(— A(n — m)),exp(—(m/N — 1)+)) 

Where we have used that m < n and that n > N. Optimizing the last term, we are led to 
consider 



m 



1+A7V 



Note that this integer number is larger than N for n > N+l/X and the corollary's inequality 
follows easily from the obvious bounds 

n . n 



!+A7v 



— 1 < m < 



!+A7v 



Remark 4.3 The limit distribution appearing in Theorem \4-l\ is the same as the law of 
the coalescence time for the Kingman process (see for instance This could have been 

expected, since it is known that, suitably "rearranged" , the mapping-valued Markov process 
(-Bnytl )teR+ converges to the Kingman coalescent process for large N. Nevertheless, at 
our best knowledge, this convergence takes place in a weak sense which does not permit to 
deduce the results presented here. In fact, the latter could serve to strengthen the previous 
convergence. 

Before proving Theorem 14.11 we will investigate the simpler problem where only negative 
jumps of unit length are permitted. More precisely, let M^' be the transition matrix 
defined by 



V 1 < q,p < N, 



g,p 



1 




N q 



if p = q 

if p = q — 1 
otherwise 



Every corresponding notion will be overlined by a tilde. Then we have a result which is 
similar to Theorem 14.11 with nevertheless some slight differences: 

Proposition 4.4 Let (i^N^N* be a sequence of integers satisfying 1 < ijy < N for any 
N G N*, diverging to infinity and such that a := lirn/v— >oo i-N /N exists in [0,1]. Then the 
following convergence in law takes place for large N 



N 



(i+m 
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where (£i)i£N is an independent family of exponential variables of parameter 1. Furthermore, 
we have for any fixed < a < 1, 



sup E 



exp q- 



N 



n(a) 



The case of a fixed initial condition, i.e. when there exists i G N \ {0, 1} such that for any 
N > i, ijy = i, is also instructive, despite the fact it is not included in the previous result: 

Lemma 4.5 For given i G N\ {0, 1}, the following convergence in law takes place for large 
N, 



S (N,i) 

N 



E 



0<Ki-2 



(Z + !)(! + 2) 



Si 



where the £\, for < / < i — 2, are independent exponential variables of parameter 1. 
Furthermore, we have for any fixed < a < 1, 



sup E 

N€N*,N>i 



exp a- 



N 



- n t 



0<Ki-2 



2a 

(Z+l)(Z+2) 



o, 



Let 2 < i < N be given. By a backward iteration and with the convention that 
we define for 1 < j < i — 1, 

5f° := infjnGN:^) = j 
In words, is the time necessary for the Markov chain (Rl^'^) ne ^ to jump from j + 1 to 



j. For 1 < j < i — 1, let us also denote T, 



(N) 



1 andpjvj := AfjJ3 J = (N) j+1 /N^+\ 



It is clear that T> is distributed as a geometric law of parameter Pnj, namely, 

VmeN, P[rf°=m] = {l-PNMj 
Furthermore, the variables Fj N \ for 1 < j < i — 1, are independent and we can write 



E 



T; 



(AT) 



i<i<*-i 



(4.2) 



l<j<i-l 

This leads us to study the individual behavior of the summands: 



Lemma 4.6 With the above notation and for a fixed j £ N* , we are insured of the following 
convergence in law as N goes to infinity, 



(AT) 



N 



3(3 + 1) 



£ 



where £ is an exponential variable of parameter 1. Furthermore, we have for any fixed 
< a < j(j + l)/2, 



sup E 

NeN*,N>j+l 



T. 



(AT)- 



exp a- 



N 



lim E 

N-kx 



exp a- 



N 



I 2a 
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The simplest way to prove the Lemma seems to resort to Laplace's transform. So we 
compute that for any a£l, 

E[exp(a7f)/A0] = ( id^fc , if exp(a/iV) m , < 1 

[ +00 , otherwise 

The condition exp(a/N)pNj < 1 is equivalent to 



a < -N £ ln(l-A 

l<k<j 



and taking into account the convexity inequality — ln(l — x) > x, valid for any x < 1, we 
get that it is in particular fulfilled for < a < j(j + l)/2. 

Furthermore, using an asymptotic expansion of pjvj m we show without difficulty that 
uniformly for a on any compact set of (— 00, j(j + 1)/2) (in particular in some neighborhoods 
ofO), 

lim 1 



JV=So 1 - exp(a/N)p Nd 1 - JI |^ Ty 

Since for a < j(j + l)/2, the above r.h.s. coincides with the Laplace's transform of jrppfj S, 
a well-known result (see for instance Theorem 0.5 of [12] . indeed, as we are working with 
nonnegative random variables, only a right neighborhood of is required) enables us to 
conclude to the announced convergence in law. 

Concerning the upper bound, we begin by remarking that by a previously mentioned con- 
vexity inequality, we have 

V N > j, p N j < exp 1 7 "' 



2N 

Next, we notice that for a > 0, the mapping 

1 - t 



[0,exp(-a/iV)) 3 t 



1 - exp(a/N)t 
is increasing, so that 

l-p NJ < 1 - ex P(-^) 



1 - exp(a/N) PN)j 1 _ eX p( a/A r) exp 

Finally, we consider for fixed x > 0, the function 

if : [0,x]By x(l — exp(y — x)) + (y — x)(l — exp(— x)) 

A variation study shows that it is increasing up to some point y x belonging to (0, x) and 
decreasing after this point. Since cp(0) = tp(x) = 0, we get that cp is nonnegative on [0,x]. 
Thus we obtain that 

1 — exp(— x) x 
V0<y<x, ^ < 



1 — exp(y — x) x — y 

Applying this inequality with x = j(j + 1)/(2N) and y = oc/N, it appears that for < a < 
j(j + l)/2 and N > j + 1, 



E 



exp a— 



1 

< 



2a 
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Since the l.h.s. was already seen to converge to the r.h.s. for large N, we can conclude that 
the desired equalities hold. 



The proof of Lemma 14.51 follows at once from the decomposition (|4.2p and Lemma 14.6 



The proof of Proposition 14.41 is based on arguments that are close to the preceding ones. 
More precisely, similarly to (|4.2p . we can write 



SW») - ijv + 1 

Thus we get for any a < 1, 



E 



exp a- 



N 



l<j<iiv-l 



n 

l<j<iN-l 



(N) 



1 - PN,j 



< 



n 



1 — ex.p(a/N)pNj 
1 



1 



2a 



i<j<*jv-i J'O'+i) 

< n(a) 

Let i £ N* be fixed (at first). By assumption, for N large enough, we will have ijv > i and 
quite obviously, S^ N ' lN ^ — in + 1 will stochastically dominate S^ N '^ + which implies, 
for < a < 1, 



E 



exp a- 



SWn) -i N + l 



N 



> E 



exp a- 



JV 



and thus 



lim inf E 

N—too 



exp a- 



A/ 



> lim E 

AT— >oo 



exp a- 



II 



2a 



l<i<i-i iO'+i) 



Letting i go to infinity, it appears that 



lim E 

AT— >oo 



exp a- 



S(^n) -i N + i 



N 



11(a) 



and the last part of Proposition 14.41 follows. We also obtain that for < a < 1, 



lim E 

jV— >oo 



exp a- 



iV 



exp(aa)II(a) 



and to conclude to the announced convergence in law, it remains to check that above con- 
vergence is uniform in some compact right neighborhood of 0. But this is a consequence 
of Dini's theorem, since the r.h.s. is continuous in a £ [0, 1) and for all fixed JV 6 N*, the 
mapping [0, 1)3oh E[exp(a5( 7V '^)/A^)] is increasing. 

We now proceed to the proof of Theorem 14.11 Its second part is the simplest one, since 
g(N,i N ) j g stochastically dominated by S^ N ' lN ^: 



VneN, P[s( n m) > n ] < F[S {N ' iff) > n] 
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This fact is based on two observations: on one hand, assuming that the Markov chain 
ft{N,i N ) - g ^ n s t a t e j g [jjy] at some time n, it will wait the same time to jump out of it as 
R( N ' lN \ but R( N ' lN } will visit less states in [i^] than R( N ' lN K Taking into account these two 
facts, the above inequalities follow immediately. Details are left to the reader: one can e.g. 
construct a coupling between R( N ^) and rWn) such that F[S^ N ' iN ^> > 5 (iV ' iiv) ] = 1 (for a 
general reference on the subject, see for instance [9]). 
In particular we get that for any a > 0, 

sup E[exp(aS (N > iN) /N)\ < sup E[exp(aS (7V ' iJv) /N)\ 

TVeN* TVgN* 



so the wanted upper bound follows from that of Proposition I4.4L 

We will need to work more to obtain the convergence in law. Heuristically the proof is 
based on the fact 

• that the Markov chain R( N ' lN ^ will rapidly reach some point negligible with respect 
to N (but however going to infinity with N) 

• and that from this point to 1, R( N > iN > and R^ N ' tN ^ are quite similar, which will enable 
us to make use of Proposition 14.41 

We begin by showing the second assertion, namely that for not too large ijy, R( N > iN > and 
are almost the same. The following lemma will enable us to quantify the "not too 

large" . 

Notation 4.7 In the following, we will sometimes drop the superscript (N, i) in R( N '^ and 
T (N,i) £g defined below) when no confusion is possible, in order to make the proofs more 
readable. 

Lemma 4.8 There exists a constant \i > such that for any 2 < q < N/2, 

V n G N, V i G N*, F[R^<Ri N ^-2\R^=q] < X ij^ 
Indeed, by definition, we have for any 2 < q < N and n E N, 

^[Rn+l < Rn — 2|-R n = q] = 1 — W[Rn+l = Rn\Rn = Q] ~ ^[Rn+1 = Rn ~ l|-Rn = l] 

1 " -^(S(q,q)(N) q + S(q,q - 1) (#),-! ) 
N-g + l+ q{q - 1] 




l<Kg-2 

But we note that there exists a constant x > such that 

V < i < 1/2, ln(l-x) > -x- X x 2 

thus we get 

;2 



E Hi-w > - £ (ji+xw) 

(g-2)(g-l)(2g-3) 



l<Kq-2 l<K<?-2 

(g-2)(g-l) x 
2iV 6iV 2 



(g-2)(g-l) X 3 
2iV 3iV 2 9 
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Taking into account the convexity inequality exp(x) > 1 + x, valid for all the wanted 

probability is then bounded above by 

' _ (g-2)(q-l) _ x 3U, , (g-l)(g-2) 
2N 3N 29 J V 2N 

4 

expression which is dominated by Xijb f° r an appropriate choice of the constant xi- 



Let us remark that the previous lemma enables us to get useful estimates on natural cou- 
plings between R( N ' lN ^ and R( N ' lN \ when ijy is quite small in comparison with N. More 
precisely, we begin by constructing R( N ' lN ^ and we define 

tfNte) in f{ n > o : < - 2} 

Then we take r\ NM) = R { n N ' iN) for n < and R%% +1 = ~ L Next ' we 

use the transition matrix M^ N ^ to construct (Rn' iN ^) n> ^(N,iN) in a traditional way (and 
independently from the above constructions). It is easy to check that this gives a coupling 
between rW*) and rW") and we have for any An > 0, at least if ijy < N/2, 

F[3 n e N : R^^ n) ^ kY M) ] 
= P[f(^) < S(N,i N )} 

< F ^{N,i N ) < s(N,i N ), S(N,i N ) < A N N] + ¥[S{N,i N ) > A N N] 

< A N N max + W[S(N, i N ) > A N N] 

= XiA N l f+nS(N,i N ) >A N N] 

Thus it appears that if the sequence (ijv)iVeN* of initial states diverging to infinity satisfies 
furthermore 

lim = (4.3) 

then we can choose a sequence (An)ngn* diverging to infinity and verifying 

lim A N % = 



so that 



In particular 



lim P[fl(^) + R( N ^] 
N—>oo 



lim PpW*") + S^'^} = 

N^oo 

and we get from proposition l4.4l (applied with a = 0, because (|4.3|) implies that limjv^oo ijy/N 
0) that 

' + !)(/ + 2) ' 

To treat the general case, we now turn our attention toward the first assertion before 
Lemma 14.81 (that the Markov chain R( N > iN ) will rapidly reach some point negligible with 
respect to N). So for 1 < j < i < N, we consider the reaching time 



T- 



(JV,0 



inf{n G N : R { n N ^ < j} 



3 

It will be very useful to know that when Ry^M is jumping through j, it is not going too far 
away from j. The next lemma gives an estimate in this direction: 
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Lemma 4.9 There exists a constant \2 > 0, such that for all 4 < j < V N and all j < i < 
N, we have 

,■4 



^ < 3/2 



3 



, J 



Let us prove the Lemma. For simplicity, we write k := \J/2\ , and for 1 < I < k, j < m < N 
and n 6 N, we consider the quantity 

F[R n+1 = l\R n = m] S(m,l) (JV), 



F[R n+1 = I + k\R n = m] S(m, I + k) (N) k+l 

We remark that S(m, I) < S(m, I + k)S(l + k,l), because if we have a partitioning of [m] into 
I + k blocks (that we can order through their respective smaller elements) and a partitioning 
of [I + k] into I blocks, we can naturally construct a partitioning of [m] into / blocks by 
composing them and this mapping is clearly onto. Thus, since S(l + k,l + k) = 1, we have 

F[R n+1 = l\R n = m] < F[R n+1 = l\R n = I + k] 



F[R n+1 = I + k\R n = m] ~ F[R n+1 = I + k\R n = I + k] 
But the last denominator is 

expression which is bounded below by a positive constant, uniformly in N £ N and I + k < 
j < y/N (as it was seen in the proof of lemma |4T8|) . Since k > 2, we have 

F[R n+1 = l\R n = I + k] < F[R n+1 <l + k-2\R n = l + k] 

and we have seen in lemma 14.51 that this quantity is bounded above by j 4 /N 2 up to an 
universal constant, for I + k < j. Thus there exists a constant x > such that for any 
N, k, I, m, n as above, we have (since for 4 < j < yN, we have \/N < N/2), 

P[i? n+1 = /|i? n = m]< x j^F[R n+1 = I + k\R n = m] 
and summing these inequalities for 1 < I < k, we get 

P[i? n+1 < fc|i? n = m] < X^[k < R n+1 < 2k\R n = m] 

< xj^nRn+i < 2k\R n = m] 

This bound can also be rewriten 

f 

F[R n +i <k,R n = m, Tj =n\ < Xj^[Rn = m, tj = n] 
and summing over all n E N and j < m < N, we obtain finally the wanted inequality. 

■ 

With this estimate at hand, we will be able to take advantage of an approach inspired by 
section 1.4 of the book [10] of Motwani and Raghavan, to investigate the expectation of 

3 

Lemma 4.10 There exists a constant xz > such that for all 4: < j < i < N with j < N 2 / 5 , 
we have 

E[r] '} < X3 — 
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Thus the Markov chain R( N >W goes relatively fast from N to LA'" 2 / 5 J . 

To prove the Lemma, notice first that, by definition of the chain R, we find that for any 
n G N and any 1 < q < N, 

E[N- R n+1 \R n = q\ = ^~ q Y. S ^P) ( N ~ P) 

i<p<<? 

Using the fact that (N) p (N - p) = N (N - l) p , we find that 

E[N-R n+1 \R n = q] = ^ ^ S(q,p)(N-l) p 

i<p<<? 
(N — l) q 

w- 1 

1 \i 



N[1 ~N 



This implies that for any 1 < q < N, 



E [R n - R n+1 \Rn = q] = E [q - N + N - R n+1 \ R n 



= 1~ N + N {'-N 
We are thus led to study the function defined by 

1 



Vs>l, g(s) := s-AT + AT^l ^ 
and will show that there exists a constant x such that 



(s - I) 2 

ViV>2, Vl<s<iV, g(s) > -A- (4.4) 



XN 



Indeed, we compute that 



Vs>l, g"{s) = ATln 2 (l-i-Vl-i- 



N J V N 

and we see without difficulty that there exists a universal positive constant such that the 
r.h.s. is bounded below by l/(xN) for N > 2 and 1 < s < N. Furthermore, we have 



</(!) = l + (iV-l)ln(l-i") 



and as a function of N £ [2, +oo[, this quantity is decreasing, so that it is positive since its 
limit in +oo is 0. 

Now, the lower bound (|4.4p follows from a second order Taylor-Lagrange formula. As a 
by-product of the above facts, we deduce that g is increasing on [l,+oo), since g'(l) > 
and g" > 0. Furthermore, we note that a reverse bound is valid: 

ViV>2,Vs>0, g(s) < ^ (4.5) 



Indeed, we have for any N >2 and r > 0, 



g(rN) = jVlr-1+^1-- 
< N (r - 1 + exp(-r)) 



19 



but for r > 0, it is easily seen that the expression between parentheses is less than r 2 /2, so 
we get (|4.5p by replacing s by rN. It also follows from (|4.4p and (j4.5j) that there exists a 
constant % > such that g(l + 1) < xg{l) holds for any I > 2. 
Next we consider the stochastic chain M defined iteratively by 



VtiSN, 



M„ := n 



+ ^ 9(1) 



1<1<R„ 



Let us check that it is a supermartingale with respect to the filtration (J- n )neN naturally 
generated by the Markov chain R. It is clearly adapted, so for n G N, we compute, via the 
Markov property and the fact that g is increasing, that 



E[M n+ i - M n \F n ] 
= E 



< E 



i? n — -R n +1 



1 



9{Rn) 



9(1) 



But the last rhs is zero by definition of g, since it can also be written 



E 



1 



Rn ~ Rn+1 



Rn 



g(Rn) 







Next we apply Doob's stopping theorem to the nonnegative surmartingale M with re- 
spect to the stopping time Tj, to get 



E[M T ,] < E[M ] 



But we note that on one hand, 



E[M ] 



and on the other hand, by definition, 
so it appears that 



y — 



1<Z<_R T . yv ' 



< 



X 



E (_£.iflTT) 



< P 



5(s + 1) 



< XE I x / ^ds 



./It, 
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Thus it remains to evaluate the last expectation. To do that, we decompose it into two 
parts: 

The last term is bounded above by 2/j, while by lemma 14.101 the first term is of order 
j /N 2 , up to an universal constant. Since by our assumption we have j 4 /N 2 < 1/j, the 
annouced estimate follows at once. 

■ 

Now we can continue the proof of Theorem 14.11 let us consider 

ViVeN*, j N := ijv A [iV 1 / 5 ] 

and to simplify notations, let us write tn := t-^' if jn < *jv an d tjv = if jjv = in- 
Then we can decompose S^ N,lN ^ into rjy + tn, where 

r N := M{n € N : R^ = 1} 

From lemma 14.101 we see that tn/N converges in probability to zero for large N. Thus we 
are led to study the behavior ofr^/N for large N, and we begin by noting that conditionally 

to R^' iN \ tn has the same law as S^ N ' Rt n \ So we have to check that R^' iN ^ is not too 
small and this is insured by lemma |4"U1 

To conclude, we remark that by previous estimates, we already know that the family of 
the distributions of the S^ N ' ZN ^/N (or equivalently of the tn/N), for N 6 N*, is relatively 
compact for the weak convergence. So we just have to verify that for any converging sub- 
sequence, the limit coincides with the law of Y2ieN (i+i)(l+2) ^ 1 ' ^ or ^ s purpose, we can 
furthermore assume, up to taking again a subsequence, that the subsequence at hand is 
indiced by an increasing sequence (N p ) p ^ of integers verifying jm p < in p and such that 

EV /B < +°° 

Then by Borel-Cantelli Lemma and Lemma 14.9} we have that a.s., Rt n ^' Np ^ is diverging to 
infinity for large p. But conditionally on that, we are brought back to the situation where 
(|4.3p is satisfied (replacing ijv by R^,*' JVp ^), so Theorem 14.11 follows. 

Remark 4.11 To avoid the above a.s. argument, we can re-examine the proof of propo- 
sition and see that instead of deterministic initial conditions (ijv)iVeNj we could have 
considered random initial conditions (ijv)jveN (such that for each N £ N* ; i^ is independent 
from the randomness necessary to the evolution of the Markov chain R( N > lN> ) ) diverging to 
infinity in probability. Thus, via lemma \4-5\ to obtain the wanted convergence in law, it 
is sufficient to show that R^' 1 ^ is diverging to infinity in probability. But this is an easy 
consequence of lemma \4-9\ 

5 Planar genealogical tree based representations 

As we promised in the end of the introduction, this last section is concerned with designing 
a description of the invariant measure in terms of planar genealogical trees. Most of the 
arguments are only sketched, since they follow the same lines as the ones developed in [1]. 

We start by recalling that for any pair of mappings a < b, we have a = cb for some non 
unique c G A. The mapping b induces the following equivalence relation ~& on A 

c ~6 c <^=> cb = c'b 
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Equivalently, c ~& d if and only if the restriction of c and d to the image of a are equal, 
so that the equivalence class c G >/4y~ 6 can be seen as the corresponding map c from 6([iV]) 
into [N]; and we therefore have 

\A^ b \=NW 

In terms of backward genealogies, for a given fixed pair of mappings a < b, the mapping c 
represents the way the individuals b([iV]) choose their parents, and the composition mapping 
a = cb represents the way the individuals [N] choose their grand parents in a([iV]). Notice 
that in this interpretation, not all the individuals in the previous generations are ancestors, 
the individuals that do not leave descendants are not counted. 
We further suppose that we are given a function 

9 : (a, b) G {(a', b') G A 2 : a' < b'} ^ 9{a, b) G M 

such that 6(a,b) = 6(a',b'), as soon as the pair of mappings (a, a'), and resp. (b,b r ), only 
differs in the way the sets a([iV]) and a'([N]), and resp. b([N}) and b'([N}), are labeled. 
That is, 6(a,b) = 9(a',b') whenever there exist increasing bijections a resp. (3 from a([iV]) 
to a'([N}) resp. b([N\) to b'([N\) s.t. a' = era and b' = (5b. We write (a',b') = (a, 6) when 
this property is satisfied. The relation = is an equivalence relation. We also mention that, 
for a given pair (a,b), there are 

( N \ ( N \ _ {N) H (AT),,, 
\\a\) X \\b\)- \a\\ \b\\ 

pairs of mappings s.t. (a',b') = (a,b). Moreover, there is a unique pair (a',b') with the 
property a'([iV]) = [\a\] and 6'([iV]) = [\b\]. We use the familiar combinatorial terminology 
and refer to this process (the replacement of the set o([iV]) by [\a\], of a by a', and so on), 
as the standardization process (of subsets of N, of maps between these subsets...) Therefore, 
to evaluate 9(a, b), up to a change of indexes we can replace the pair of sets (a([iV]), b([N])), 
by the pair ([|a|], [|6|]), and the pair of mappings (a,c), by a unique pair of surjections 
(a',c') G ([|o|]^ x [|6|]t' a "). In other terms, the pair of mappings b < a is canonically 
associated to a pair of surjections 

[\b\]^[\a\]^ [N] 

To describe precisely the planar tree representations of the stationary population of the 
neutral genetic model (|2.ip . it is convenient to introduce a series of multi index notation. 

Definition 5.1 We letQ q be the symmetric group of all permutations of the set [q], and for 
any weakly decreasing sequence of integers 

q e Qn := {(qo, ■ j Qn) £ : go = N > qi > . . . > q n > 1} 

we use the multi index notation 

q! := [J q k \ (iV) q = \{ {N) qk and |q| := £ q k 

0<k<n 0<k<n 0<k<n 

We also introduce the sets 

d(q) := {c = (co, . . . ,c n ) G C n : VO < p < n \c p \ = q p+ i} and Q n (q) := Y\ Gq v 

0<p<n 

with the convention q n +i = 1. Finally, we denote by <S n (q) the set of all sequences of 
sujections a = (a , ■■■ ,a n ) G ([qi]^ x ... x [q n ]^ n -^ x [1]^) . 
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Running back in time the arguments given above, any coalescent sequence of mappings 
c = (co, • • • , c n ) G C n (q) is associated in a canonical way to a unique sequence of surjective 
mappings 

a = (a , . . . ,a n ) G S„(q) 
so that, up to standardization, we have that 

c = 7r(a) := (a , aia , a 2 aia , . . . , a n a n _i . . . a ) 

To put this again in another way, the coalescence sequence c has, up to standardization, the 
following backward representation 

{1} [q n ] ^ [gn-l] <— • • • — [ft] ^ [ft] ^ [iV] (5.1) 

In terms of genealogical trees, the mapping ao describes the way the N individuals in the 
present generation choose their parents among q\ ancestors; the mapping a\ describes the 
way these q\ individuals again choose their parents among q^ ancestors, and so on. 

From these considerations, by symmetry arguments, the invariant measure of Prop, 
can be expressed in the following way 

n>0qeQ„ aeS„(q) 



We also have the more synthetic representation formula 

(% 



F ^ F )=E^)T ^yr^(-)( F ) With 5:=U n > U qeQ „5„(q) 



and the multi index mapping p(a) := q, for any a = (ao, . . . , a n ) G <S n (q). 

Although this parametrization by sequences of surjective maps of the expansion of the 
invariant measure is already much better than the one in Prop. 13.31 it does not take into 
account the full symmetry properties of r„. To take fully advantage of it, notice first of all 
that the labelling of individuals in a genetic population is arbitrary. From the algebraic point 
of view, the invariant probability measure constructed in Prop. [3731 inherits this property in 
the sense that 

f,(F) = f ,{F°) 

for any a G Q n , where F a (x\, x n ) := F(x a m, ...,x a r n ))- I n particular, when computing 
T^F) we will always assume from now on that F is symmetry invariant, that is, that 
F a = F for all a E Q n -a property that we write F G B^ ym (E N ). If F does not have this 
property, its symmetrization F has it, where 



n .1 * — ' 



and the invariance properties of insure that T fJi (F) = T^(F) 

We then consider the following natural action of the permutation group Gq on the set 
S n (q), defined for any s = (s p ) < p <n G ^n(q), and a G <S n (q) by 

s(a) := (siaoSg 1 , . . . , cinS' 1 ) 

We let Z{a) := {s : s(a) = a} be the stabilizer of a. 

This group action induces a partition of the set <S n (q) into orbit sets or equivalent 
classes, where a and a' are equivalent if, and only if, there exists s G £/ n (q) such that 
a' = s(a). The set of equivalence classes is written / n (q). In terms of ancestral lines, the 
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genealogical trees associated with the sequences of mappings s(a) and a only differ by a 
change of labels of the ancestors, at each level set. By induction on n, it can be shown 
that each orbit, or equivalence class, contains an element in the subset 5^(q) C <S n (q) of 
all sequences of weakly increasing surjections. This observation can be used, together with 
standard techniques such as lexicographical ordering of sequences of sequences of integers 
to construct a canonical representative in S^(q) for each equivalence class. Also notice the 
mappings a p(a), and a i— > [J> n ( a ) are invariant; that is we have that p(a) = p(a'), and 
^■K{a'){^) = f Ji n(a.)(P) f° r an Y F £ B s b ym {E N ), as soon as a' = s(a), for some relabeling 
permutation sequence s. Thus, applying the class formula, we find the following formula. 

Proposition 5.2 For any function F G Bf ym (E ), we have 

F»(F) = Yl JzhM N \ P t)\ ^M a )( F ) 

aG«S' 1 y n 

with the set S' := U n >o U qe g n I n (q)- 

This functional representation formula can be interpreted in terms of genealogical trees. 
The precise description of this alternative interpretation is notationally consuming, so that 
it will be only sketched. The reader is refered to pp for details on the subject. 

Recall that a rooted tree t is an acyclic, connected, directed graph in which any vertex 
has at most an outgoing edge. The paths are oriented from the vertices to the root, and 
a leaf in a tree is a vertex without any incoming edge. Notice first that any sequence of 
mappings a = (ao,...,a n ) in S n (q) gives rise to a directed graph -defined as usual: that 
is, to any element k in [qi\ we associate a vertex of the graph and a directed arrow to the 
vertex associated to the element ai(k) in [<fo+i]. Two sequences in S n (q) are equivalent 
under the action of <5 n (q) if an d only if they have the same underlying abstract graph -see 
PP. It follows that In(q) is isomorphic to the set ^(q) of all rooted trees t, with height 
ht(t) = (n + 1), with g , ri _( p _x) vertices at each level p = 1, . . . , (n + 1), and with leaves 
only at level (n + 1). The set of all planar trees is denoted by T := U n >o U qG Q n 7^(q). 
Next, with a slight abuse of notation, for any choice a G S' n (q) of a representative of a tree 
t G T := U n > U qeQn T n (q) we set 

Att = ^(a) Z(t) = Z(a) and p(t) = p(a) 

In terms of genealogical trees, we finally have that 

teT 1 Wl 

A closed formula of the cardinal of the stabilizer Z(t) can be easily derived using the 
combinatorial techniques developed in the article pp. Firstly, we recall some notions. 

Definition 5.3 A forest f is a multiset of trees, that is an element of the commutative 
monoid on the set of trees. We denote by B(t) the forest obtained by cutting the root of tree 
t; that is, removing its root vertex, and all its incoming edges. Conversely, we denote by 
S _1 (f) the tree deduced from the forest f by adding a common root to its rooted tree. We 
write 

f : =t^...t^ (5.2) 

for the forest with the trees tj appearing with multiplicity nii, with 1 < i < k. When the 
trees {ti)i=\...k are pairwise distinct, we say that the forest is written in normal form. 
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Definition 5.4 The symmetry multiset of a tree is defined as follows 
t = B-\t^ . . . t™*) => S(t) := (mi, . . . , m k ) 
The symmetry multiset of a forest is the disjoint union of the symmetry multisets of its trees 

( \ 



S(t^ . . . O := S(ti), . . . , S(ti), . . . , S(t fc ), . . . , S(t fc ) 
y » ' v „ ' 

\ mi- terms m k - terms ) 



Following the proof of theorem 3.8 in pQ, we find the following closed formula 

\Z(t)\ = J] 8(3*0;))!, 

0<i<W(t) 
k 

where we use the multiset notation S(t)! := Y\ m i- if 8(t) = (mi, m^). The above 

i=l 

discussion is summarized in the following proposition. 



Proposition 5.5 



te j- JV LL0<i<ht(t) 
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